AN IMPROVED SUM-PRODUCT INEQUALITY IN FIELDS OF 

PRIME ORDER 



MISHA RUDNEV 

Abstract. This note proves the sum-product inequality 
max(|A ■ A\, \A + A\) > |A| 1+ tt 



log 11 |A| 

for sets of cardinality \A\ < y/p in prime fields F p . 



1. Introduction 



Let A be a subset of the multiplicative group F* of the prime residue field F p (p 
is thought to be large, the constants throughout being independent of p). The sum 
set A + A is defined as {a± + 02 : a±, G A}; in the same fashion one defines the 
difference, product, and ratio sets A — A, A ■ A, A : A, respectively. 

The sum-product estimate of Bourgain, Katz, and Tao ([2]) and Konyagin ([5]) 
establishes that as long as \A\ < p 1_A , one has 

(1.1) max(\A + A\, \A-A\) > c(A) \A\ l+5( - A \ 

for some c(A) and <5(A) > 0. In what follows, finite set cardinalities are denoted via 
I • |; c and C denote some universal constants, changing from one place to another. 

The nest question, along the lines of the Erdos-Szemeredi conjecture for the in- 
tegers, is to establish quantitative estimates as to the relation 5(A). Clearly, there 
is no no-trivial sum-product estimate in the case A = 0, i.e when the size of A is 
commensurate with the size of the whole field. Therefore, in order to get a uniform 
quantitative estimate, one should impose a restriction on the size of A, which is 
usually \A\ < ^/p, i.e A > ^, which is assumed further. (When A becomes smaller 
than ^, calculations involving exponential sums start yielding non-trivial results, 
rather quickly, as A decreases, overpowering the combinatorial estimates that this 
and the majority of the quoted papers deal with. See e.g. [4j- The case \A\ > ^/p is 
no longer discussed in the sequel.) 

Garaev [3] succeeded in obtaining the first quantitative sum-product estimate in 
¥ p : either the sum or the product set shall have cardinality |A| 1+ T4 , up to a multiple 
of a power of clog|A|. Katz and Shen ([6]) elaborated on a particular application 
of the Plunnecke-Ruzsa inequality (cited here as Lemma [T|) in Garaev's proof and 
improved to j^. Bourgain and Garaev (PQ) incorporated a covering argument 
(whose variant, taken from [9], is cited here as Lemma [2]) and improved ^ to 
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And Li ([7|) showed that a multiple of a power of log \A\ can be done away with, so 
today's state-of-the art, for \A\ < yfp is that 

max(|^-A|, |A + A]) > c|A| 1+ ^, 

for some absolute c. Moreover, one can replace either one or both the product set 
A ■ A with the ratio set A : A and the sum set A + A with the difference set A — A. 

On the heuristic level, the proof of the above series of results effectively begins 
with assuming that the multiplicative energy of A is large, which follows from the 
assumption that the product set (or the ratio set) is small. Then one chooses a 
"popular" dyadic interval, with approximately constant number of points of A x A 
on the lines through the origin in ¥ p x F p , corresponding to different ratios. The 
decision on the meaning of "popular" can be postponed to the end of the proof and 
depend on what the final estimates are. This fact that the notion of "popularity" 
could be adjusted to the final estimates, rather specified this in the "natural" way in 
the beginning of the proof was the observation due to Li (j7j), which enabled him to 
get rid of the factor log \A\ which permeated the results preceding his. Throughout 
the main body of the proof one uses the "standard" arithmetic combinatorics tech- 
nique: Pliinnecke-Ruzsa inequalities, covering lemmas and additive pivoting, which 
is roughly speaking estimating the size of A + rA for some dilation r, in order to 
bring the sum or the difference set into play (as one wishes, since the choice of A ±A 
is allowed in both Pliinnecke-Ruzsa inequalities and the Covering lemma). 

The technical issues involved in the estimates used to improve Garaev's original 
jl to j2 have piled up, and this note offers yet another one, which enables to reduce 
"the worst possible scenario" estimating the size of A + rA in the state-of-the-art 
literature, (that is this case "to blame" for the final answer ^) to the "normal case" 
which would yield only if it were shown to be the only scenario. 

This is done essentially via an application of the pigeonhole principle in Lemma 
[3] below which is used in the case study in the forthcoming proof. 

The main result is as follows. 

Theorem 1. Let A C F* with \A\ < ^/p and p bigger than some absolute constant. 
Then, for some absolute c, 

m.zx.(\A-A\, \A + A\) > c\A\ l+ Ti T . 

logTT \A\ 

Remark 2. One can replace either one or both the product set A ■ A with the ratio 
set A : A - in which case the logarithmic factor disappears - and the sum set ^4 + ^4 
with the difference set A — A. 

Notation-wise, the symbols <C, 3>,ss suppress constants. E.g \X\ S> |y| means 
\X\ > c\Y\, for some c. The English language gets abused in accordance with these 
notations by saying "at least" or "at most" in the sense conveyed by the symbols 
<C, respectively. To avoid confusion, "at least" and "at most" used in this way 
will be enclosed into quotation marks. E.g. saying that |X| is "at least" \Y\ means 

\x\ » in 
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2. Lemmata 

This section contains the two arithmetic combinatorics lemmas used. The 
first one is a slight, but very useful generalisation of the Pliinnecke-Ruzsa inequality 
(see [8]), which is due to Katz and Shen (|6j). 

Lemma 1. Let Y\ Xx, . . . be additive sets. Then for any < e < 1, there exists 
a subset \Y'\ C Y , with \Y'\ > (1 — e)|Y|, and some constant C(e), such that 

(2-1) \Y' + X l + ... + X k \ < C(e) ^yjl-t^ 1 - 

The second one is a covering lemma, see e.g. pQ, [9], [7]. 

Lemma 2. Let X\ and X<i be additive sets. Then for any e £ (0, 1) and some 

C(e) 

constant C(e), there exist -min(|Ai + A2I, \X% — X%\) translates of X2, whose 

\X 2 \ 

union contains not less than (1 — e)|Ai| elements of X\. 

3. Proof of Theorem CD 

The proof goes essentially along the lines of the lately well-established approach, 
which in particular does not distinguish between the A ± A, and applies as well to 
A : A, in which case it gets a little easier. If at the outset one deals with the product 
set, then the analysis begins by looking at the multiplicative energy whereupon it 
starts essentially dealing with ratios, rather than products. Throughout, one uses 
Lemmas [T] and [2] which admit any ± sign variation, and uses additive pivots A + rA, 
although it can be A — rA instead. Hence, what follows is confined to the sum and 
the product sets only. Suppose both |A + A| < K |A| and |A • A| < if |A|, for some 
K. 

First off, let us refine A, if necessary, so that the additive Pliinnecke-Ruzsa in- 
equality applies to A in full power as follows: 



(3.1) |A + A + A + A| < K 3 \A\. 

This will be used in the end of each of the Cases (i-iii) constituting the main body 
of the proof. 

Indeed, if (13. ip did not apply to A, one would, by Lemma dj choose a large subset 
A' of A (containing, say 90% of its elements) such that with A' replacing A, the 
estimate (13. ip would be in place, and proceed with A'. As several more refinements 
will be made in the sequel, this first one is adopted without the change of notation: 
A from now on stands for the possibly refined original set, which satisfies (13. ip as 
well as the assumptions |A + A| < K\A\ and |A • A| < K\A\. 
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The multiplicative energy of E*(A), i.e the number of ordered quadruples 
(01,02,03,04) EAxAxAxA, satisfying the equation 

a\ 04 
a 3 a 2 

is, by Cauchy-Schwarts, bounded from below as follows: 

(3.2) E , {A)> _^ >> \^. 

The multiplicative energy, by its definition, is 

E*(A) = J2n 2 (0, 

where £ is the slope of a line through the origin in F^, and n(£) is the number of 
points of A x A on this line. 

Let us partition these lines, identified by their slopes, into dyadic groups as to 
how populated by points of A x A they are: a line in the jth group, j = 1,2,... < 
1 + log 2 \A\ will support 2- ?_1 < n(£) < 2- ? points of Ax A. We shall consider a single 
"popular" dyadic group, to be specified in the end of the proof (although the choice 
will be straightforward). This dyadic group contains some number L of lines, each 
supporting approximately N points of A x A, rendering the contribution M into the 
multiplicative energy E*(A). One has 

(3.3) LN 2 w M. 
This implies, since LN <C \A\ 2 , and N < \A\, that 

M 

(3.4) L,iV» 



14 



2 ' 



Let S denote the set of slopes of the lines through the origin in the chosen dyadic 
group and P denote the set of points of A x A supported on these lines. Since each 
line supports approximately the same number N of points, one has 

(3.5) \P\ » LN. 

Let 

Ax= {y:(x,y)€P}, 
1 ' ' A y = {x:(x,y)GP}. 

Namely A x is the set of ordinates of points of P all having the same abscissa x and 
A y the set of abscissae of points of P all having the same ordinate y. For y e A x 
let ^4y/ x = A^ (with £ = ^ ) be the set of abscissae of the points of P supported on 
the line through the origin with the slope £ = f passing through (x,y). So, for all 
y G A x , \A y/x \ w iV. 

The next lemma is an important building block for the proof. 
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Lemma 3. There exist a popular abscissa x = x and a popular ordinate y = y, i.e. 

bset A 

LM 



such that \Ax\,\Ay\ ;§> -j^p as well as a subset A% C A x , with 



(3.7) \A X \ » 



such that for every y E A x , one has 

, ^ , 7 , LAIN 

(3.8) \A y = A y/i nAy\^>^^. 

Proof of Lemma 0. To be precise, one must run some popularity arguments about 
the sets P and 5. In essence however, since \P\ ~ LN and the maximum population 
of a line with slope in S is "at most" N, this means that lower bounds pertaining to 
sums over the sets S, P also apply to the whole hierarchy of their "popular subsets" , 
for merely the price of the constants hidden in the 3> symbols getting worse. 

By the pigeonhole principle, a positive proportion of the points of P have "popular 
ordinates", namely such that a horizontal line passing through such an ordinate 
contains "at least" points of P. Let A' denote the set of these popular ordinates. 

Let us call P' the set of the points of P having popular ordinates. One still has 
\P'\ 3> LN and P' is still supported on no more than L lines passing through the 
origin. Therefore, P' still carries a positive proportion of the multiplicative energy 
M ~ LN 2 of P, and an average line (identified by its slope in £ E E) still has ~ N 
points of P'. 

Now the abscissae of the points in the set P' can be pruned in the same way, 
down to the "popular" ones, namely such that there are "at least" ^# points of P' 
supported on a vertical line with a popular abscissa, whereupon P' gets refined to 
P" . One can now disregard the lines that may have become "poor", namely those 
supporting fewer than cN points of P" . The remaining lines have a set of slopes 
3', whose cardinality is still "at least" L, and each of these lines still supports « N 
points of P" . 

The claims of the Lemma are basically the average case for the sum 
(3-9) £= \A z/x nA y \, 

x£A",y£A' z£A x 

Let us write z ~ x if (x, z) is a point of P. Then 

£> Yl \A z/x nA y \. 

Rearranging the summations by fixing the ratio C = f an d confining the sum 
only to the "popular" ratios in H' (these ratios each have "at least" N realisations 
in terms of "popular" ordinates and abscissae) one gets: 

(3.10) E»iV^ ^ \ A 'i nA vl 

£eH' y&A' 

The sum in the right-hand-side of (|3.9p takes place over "at least" L lines whose 
slopes are popular, with respect to the refined point set P", i.e. each line supporting 
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"at least" N points of the latter point set. Fixing a particular line I, let A\ be the 
set of the (popular) abscissae of the points of P" on this line. One has \A\\ 3> N, 
as well as A\ C A", the set of popular abscissae. This means, by construction of the 
sets A', A", that each element of A\ belongs to "at least" -r^r different sets A y , with 

y e A'. 

Hence, 

(3.11) E>>A ,. L .yv.^»^. 

1^1 1^1 

Now the claims (|3.8p and ()3.7p follow from (j3.9H3.ll]) by the pigeonhole principle. 
Indeed, there is a pair of (x,y) £ A" x A', such that 

LMN 



(3.12) ]T L4 z/£ n^|> 



One can restrict the summation in (|3.12|) to those values of z only, for which 

LMN 

\A z/x nA y \ > — p 

These z are to form the set A%, whose existence is stated in the Lemma. Since the 
maximum size of a single A z j x is "at most" N, then the number of such z is "at 
least" LM. □ 

A can be now scaled, so without loss of generality assume 5 = 1, and to save 
on the notations let Ax = B (and scale no more). Let us also us further use the 
notations C for the set Ay and B for the refinement of B, so that (j3.8|) holds for 
every z 6 B. Heuristically, C catches a large proportion of the abscissae of points 
of P supported on every line through the origin, whose slope is in B. 

With the parameter r S F*, consider now (as study) "additive pivots" 

C + rC, B + rB, as well as finally C + rC p , where C p will be subset of C containing 
"many" abscissae of points of P lying on the particular line to be specified, with the 
slope p G B. 

Introduce the following notation: for a pair of sets Si, S2 C ¥ p , let 

(3.13) R(Si,S 2 ) = \ - — 7 : p,q,e Si, s,t e S 2 , p ^ q,s ^ t 

{ s — t 

One now faces with three cases that deal with the sets R(C,C), R(B,B), and 
R{C, C p ), where C p is the above-mentioned subset of C, to be chosen. Case (i) and 
(ii) will constitute the normal case, where the estimates somewhat repeat what has 
been done in the state-of-the-art literature. These estimates, from [3] on, it would 
give a better estimate securing the exponent yj, rather than 7^ with the current 
state of the art. But there would always be the "worst possible scenario" which 
forced the denominator in the final answer to become by one bigger than rendered 
by the normal case. This "worst case scenario", however, gets replaced below by 
Case (iii) which gives a similar estimate, as in the normal case. 
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Recall, by Lemma ([3]), 

LM LN 
(3-14) \B\ » w , \C\ » w . 

Case (i): Suppose R(B,B) / i?(C, C). Then there are two possibilities: 

Case (i.l): There is r = 2^2 e R(B,B), such that r is not in R(C,C). Fix this 
r along with a quartet p, s, t representing it throughout the rest of Case (i.l). 
Then the equation 

(3.15) ai + ra 2 = a 3 + ra 4 , 

where ai, . . . , 04 G C, has only trivial solutions, i.e those with a 2 =04. Thus for any 
C C C, which contains a positive proportion of the elements of C, one has 

(3.16) \C' + rC'\ > |C| 2 . 



Therefore, 



(3.17) |C7| 2 < 



s — t 



<C |pC'-gC" + aC"-tC"|. 



But p, g, s, t are elements of B, and hence of the set of slopes S. Therefore one can 
bound the sizes of the corresponding dilates of C from above, using Lemma [2j 

Lemma [2] specifies the refinement C' of C which is chosen in such a way that the 
sets involved get covered by translates of A not merely in their large proportion as 
guaranteed by the Covering lemma, but by 100%. 

Recall the notation A^ <Z A for the set of the abscissae of the line with the slope 
£. For any £ G S, 99% of the set £C can be covered by "at most" 

(3 - 18) ^v - <<: ~aT 

translates of ^A^ C A (which have cardinality approximately iV), and therefore 
translates of A itself. Now one can choose C C C, appearing in (|3.16p as a subset 
containing at least every second element of C, and such that the sets pC, —qC, 
sC, —tC, each get fully covered (independently) by "at most" translates of A 
each. Which means, the whole set (s — t)C + {jp — q)C has been fully covered by 

"at most" (J^A translates of A + A + A + A. 
Together with (|3.16p this implies 

(3.19) iV 4 |C7| 2 < K 4 \A\ 4 \A + A + A + A\ < K 7 \Af, 

after using (|3.1|) . 

Case (i.2): There is r = 2=2 e R(C,C), such that r is not in R(B,B). Fix 
this r along with a quartet p,q,s,t representing it. Recall that C = Ay, the set 
of abscissae with a popular, with regard to P, ordinate y. Thus 2, | ,|, | are the 
slopes in H. And so are, since the point set P is symmetric with respect to the 
bisector y = x, the ratios £ p = |, £ q = |, £ s = |, and & = |. 
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Thus 

(3.20) r = 

£s - « 

Repeat now the argument of Case (i.l) from (13.16P through (|3.19j) replacing there 
C with B and (p, g, s, t) with £ 9 , £ s , £t). This yields the analogue of the estimate 

(prrep : 



(3.21) iV 4 |^| 2 < K 4 |A| 4 |A + ^ + ^ + ^1 < #V| 5 - 

The estimate f)3.21|) is slightly worse than (|3.19p . since the estimate on the size of 
B in Lemma is more restrictive than on the size of C, see (|3.14|) . So the estimate 
(|3.19p will not be returned to. 

By Lemma [3l one has \B\ > and as M « LiV 2 , (f3T2TT) implies that 

(3.22) M 4 < KVl 11 - 

From now on let us assume that i?(C, C) = R(B, B) = R. 

Case (ii): Suppose \R\ 3> min(|C| 2 ,p). The original assumption |A| < y/p ensures 
that the minimum is achieved when \R\ 3> \C\ 2 . For r € R, let E r (C) denote the 
number of ordered quadruples (ai, <22, 03, 04) G CxCxCxC satisfying the equation 
(|3TT5jl . 

The equation (I3.15P has trivial solutions - when 02 = 04 - and non-trivial ones - 
when (01, ci2, 03, 04) determines r. Summing over all r, one has, under the Case (ii) 
assumption: 



(3.23) Yl E ^ C ) = \ R W C \ 2 + l^l 4 « \ R \\°\ 2 - 

r&R 

Hence, there exists r £ R, such that E r (C) <C |C| 2 . Fix this r throughout the rest 
of Case (ii), together with some quartet p,q,s,t E C representing it. Observe that 
the same bound is certainly satisfied by E r (C), namely when the equation ()3. 15[) is 
restricted to a subset C C C. If C contains a positive proportion of the elements 
of B, then by Cauchy-Schwartz, 

(3.24) \C' + rC'\ > |Cf. 

Now use the representation (I3.20p for r and repeat verbatim - with (£ p , £ 9 ,£ s , &) 
replacing (p,q,s,t) there - the argument from (I3.16f) on within Case (i.l). 

From now on assume that R(C,C) = R(B,B) = R and \R\ <C |C| 2 - This takes 
one into the final Case (iii) - the analog of worst possible scenario in the former 
literature which turns out to be a regular one here. This, together with Lemma [3l 
is central for the proof. 

Case (iii): There exists some r = 2=2 £ R, such that r + 1 R (otherwise R = ¥*, 
and the condition \A\ < y/p, returns one into Case (ii)). Fix this r, together with a 
quartet p,q,s,t E B representing it. 
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Consider now the quantity p G B and the set C p , a subset of C, furnished by 
Lemma [3j with "at least" L y^i elements (set in (|3,8p y = p and recall that x = 1 

after scaling). The members of C p are the abscissae of points of P, supported on 
the line through the origin with the slope p. They also lie in the set C. By the 
assumption of Case (iii) one has R(C, C) = R(B, B) = R, and it follows that since 
r + 1 is not in R, it is not in R(C, C p ) C R. 

Thus, using Lemma El which guarantees that |C| 3> TjfT> as weu as P-8p . for any 
positive proportion subsets C, C' p of C, C p , respectively, one has 

(3.2 5 ) + + 

Let us use Lemma [1] in the following form: for any positive proportion subset C 
of C (C is to be specified later, in the ensuing covering argument), there is C" C C", 
containing, say 99% of the members of C, and such that, by (|3.25p . one has: 



LN LMN 
\A\ |A| 4 



< \C" + (r + 1)C'\ < 



c" + c£ + 



(3.26) 



|C" 

K\A\ 

~\C\ 

K\A\ 
\C\ 



c' + 



p 



(s-t)C + {p-q)C' p 
sC - tC -qC'+A 



Indeed, pC p C A, since p £ H is a slope and C p is a subset of the abscissae of the 
points of P supported on the corresponding line through the origin. In comparison 
to the above Cases (i,ii), the covering argument is now to be applied only three 
rather than four times. 

The covering argument now proceeds in exactly the same way as it was in Cases 
(i,ii), since s, t, q are members of B C H. The subsets C of C, as well as C' p of C p are 
chosen so that their dilates sC, —tC, —qCL are each fully covered (independently) 
by "at most" translates of A. (E.g. —qC' p gets covered by no more than 



-qC' p -qA q 
\A q 



K\A\ 
N 



translates of qA q C A, and therefore A itself.) 



Then (13361) yields 



LMN f LN\ 



iV 3 < K 4 \A\ 4 \A + A + A + A\ < K 7 \A\ 



by d3JB . Since M 



V 1^1 / 

LN 2 , this means: 



M 4 < K 7 \A 



ii 



restating the estimate (|3.22p . which therefore shall be taken as the final one in the 
above Case study. 
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To conclude the proof, all one has to do is choose M. In view of the latter 
estimate, which only contains the multiplicative energy M, one can only do the 
straightforward pigeonholing 

(3.27) M»^» 



log \A\ K]og\A\' 

since by the pigeonhole principle, there is a dyadic group of lines through the origin 
furnishing "at least" the above amount of multiplicative energy. 
Therefore 

(3-28) K»(-^" U 

\log \A\ 

This completes the proof of Theorem [TJ □ 

The final remark is that having in the outset the ratio set A : A, rather than the 
product set A ■ A makes the multiplicative energy argument superfluous and matters 
more transparent. S would be denote the set of "popular lines" with the slopes in 
3, each supporting "at least" ^ points of A x A. This rids one of the necessity 
of using dyadic pigeonholing with the set of slopes and as the result, there is no 
logarithm in the final estimate. Altogether the lines with the slopes in H support a 
positive proportion of the set A x A. The sets B,B, and C then become positive 
proportion subsets of A, and the relations affecting the left-hand side of the estimate 
(|3.22p would assume the simpler form LN > \A\ 2 , N > 
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